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ABSTRACT 

The clustering properties of dark matter halos are a firm prediction of modern theories of structure 
formation. We use two large volume, high-resolution N-body simulations to study how the correlation 
function of massive dark matter halos depends upon their mass, assembly history, and recent merger 
activity. We find that halos with the lowest concentrations are presently more clustered than those 
of higher concentration, the size of the effect increasing with halo mass; this agrees with trends found 
in studies of lower mass halos. The clustering dependence on other characterizations of the full mass 
accretion history appears weaker than the effect with concentration. Using the integrated correlation 
function, marked correlation functions, and a power-law fit to the correlation function, we find evidence 
that halos which have recently undergone a major merger or a large mass gain have slightly enhanced 
clustering relative to a randomly chosen population with the same mass distribution. 
Subject headings: cosmology:theory - methods:numerical - dark matter: merging histories - galaxies: 
clusters 



1. INTRODUCTION 

The observed Universe contains order on all scales we 
can probe. It is generally believed that the largest struc- 
tures arose via the amplification of primordial (quantum) 
fluctuations during a period of accelerated expansion, 
processed by the subsequent 13 Gyrs of gravitational in- 
stability. The pattern of clustering of objects on large 
scales is a calculable prediction of cosmological models 
and thus comprises one of the fundamental cosmological 
statistics. 

Within modern theories of structure formation, the 
clustering of rare, massive dark matter halos is en- 
hanc ed relative to that of the general mass distribu- 
tion llKaiserll 19841 [Efstathiou et al.lll988t iCole fc Kaiserl 
ll989llMo fe Whitdll996HSheth fe Tormenlll999|) . an ef- 
fect known as bias. The more massive the halo, the larger 
the bias. As a result, the mass of halos hosting a given 
population of objects is sometimes inferred by measuring 
their degree of clustering - allowing a statistical route to 
the notoriously difficult p roblem of measur i ng ma sses of 
cosmological objects (e.g. iCoorav fc Shethl 1)20021) 1. 

Since halos of a given mass can differ in their formation 
history and large-scale environment 6 , a natural question 
arises: do these details affect halo clustering? In cur- 
rently viable scenarios for structure formation, objects 
grow either by accretion of smaller units or by major 
mergers with comparable-sized objects. The formation 
history of a halo can thus be characterized by its mass 
accumulation over time, such as when it reached half of 
its mass, had a mass jump in a short time, or last under- 
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went a (major) merger. 

Theoretically, the simp lest descrip t ions o f halo 
grow th and clusteri ng llBond et alJ 119911 [B owci 
|199"T1 ILacev fc Colel 119931 11994 IKitavama fc Sutr 
1996a b) do not give a dep e ndence upon hal o for- 
mation history llWhitd 119931: ISheth fc Tormenl 120041 
iFurlanetto fc Kamionkowskil 120051: lHarker et al.1 12006[) 
To reprise these arguments: pick a random point in the 
universe and imagine filtering the density field around 
it on a sequence of successively smaller scales. The 
enclosed density executes a random walk, which in the 
usual prescription is taken to be uncorrelated from 
scale to scale. The formation of a halo of a given mass 
corresponds to the path passing a certain critical value 
of the density, 8 C , at a given scale. The bias of the 
halo is the 'past' of its random walk and its history 
the 'future' of the walk. All halos of the same mass 
at that time correspond to random walks crossing the 
same point, and thus have the same bias. (Note that 
the derivation, using sharp fc-space filtering, does not 
match the way the prescription is usually applied, and 
this has been suggested by some of the above authors 
as a way to obtain history dependence. Introducing 
an environmental dependence through e.g. elliptical 
collapse will also give a history dependence.) 

The lack of dependence on halo history in the simplest 
descriptions does not close the discussion theoretically 
or otherwise. While these analytic methods work much 
better than might be expected given their starting 
assumptions, the Press-Schechter base d approaches still 
suffer ma ny known difficulties (e.g. ISheth fc Pitmai] 
(|1997|) . iBenson. Kamionkowski fc Hassanil J2005)). 
Other analytical ways of estimating the cluster- 
ing of mergers have been explor ed. For example, 
IFurlanetto fc Kamionkowskil l)2005l) defined a merger 
kernel (not calculable from first principles) and assumed 
that all peaks within a certain volume eventually 
merged. Such an ansatz implies that recently merged 
halos are more clustered for M > M» and less clustered 
for M < M„, with some dependence upon predecessor 
mass ratios and redshifts. (Here M*(z) is the mass at 
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which a(M), the variance of the linear power spectrum 
smoothed on scale M, equals the thre s hold for linear 
density collapse S c (z), see e.g. iPeacoc 3 ©99).) Using 
close pairs as a proxy for recently merged halos, they 
found a similar enhancement of clustering for M > M* 
and reduction for M < M* in several (analytic) cluster- 
ing models. To foreshadow our results: the signals we 
see are consistent with this trend. 

Simple analytic models cannot be expected to capture 
all of the complexities of halo formation in hierarchical 
models, and full numerical simulations are required to 
validate and calibrate the fits. Fortunately, numerical 
simulations are now able to produce samples with suffi- 
cient statistics to test for the depe ndence of clustering on 
forma tion history. Early work bv lLemson fc Kauffmannl 
( 1999) showed that the properties of dark matter halos, 
in particular formation times, are little affected by their 
large-scale environment if the entire population of ob- 
jects is averaged over. They interpreted this as evidence 
against formation history and environment affectin g clus- 
tering. As emphasized bv lSheth fc Tormenl l)2004|) . how- 
ever, this finding — plus the well known fact that the typ- 
ical mass of halos depends on local density — implies that 
the clustering of halos of the same mass must also depend 
on formation time. Using a marked correlation func- 
tion. lSheth fc Tormenl §2004) found that close pairs tend 
to have earlier formation times than more distant pairs, 
work which was extended and confirme d bv lHarker et al.l 
( 2006). iGao. Springel fc White! IpTOl found that later 
forming, low-mass halos are less clustered than typical 
halos of the same mass at the present; a possible explana - 
tion of this resu l t was given bv lWang. Mo fc Jind {2006) . 
iWechsler et alJ l|2006fl found a similar dependence upon 
halo formation time, showing that the trend reversed for 
more massive halos and that the clustering depended on 
halo concentration. However, in order to probe to higher 
masses these authors assumed that the mass dependence 
was purely a function of the mass in units of the non- 
linear mass, then used earlier outputs to probe to higher 
values of this ratio. It should be noted that scaling quan- 
tities by M/M* gives a direct equality only if clustering is 
self-similar. Since P(k) is not a power-law and f2 ma t 7^ 1, 
a check of this approximation, as is done here, is crucial. 

These formation time dependencies are based on (usu- 
ally smooth) fits to the accretion history of the halo. 
However, halo assembly histories are often punctuated 
by large jumps from major mergers that have dramatic 
effects on the halos. Major mergers can be associ- 
ated with a wi de variety of phenomena, ran ging from 
quasar activity iKauffmann fc Haehnerfl|20 'OOP and star- 
bursts in galaxies l|Mihos fc Hernauist 1996) t o radio ha,- 
los and relics in galaxy clusters (see e.g. Sarazin 
for phenomena associated with galaxy cluster mergers). 
Major mergers of galaxy clusters are the most ener- 
getic events in the universe. It follows that major 
merger phenomena can either provide signals of inter- 
est or can cause noise in selection functions that de- 
pend upon a merger-affected observable. If recently 
merged halos cluster differently from the general pop- 
ulation (merger bias), and this is unaccounted for, con- 
clusions drawn about halos on the basis of their clus- 
tering would be suspect. The question of whether such 
merger bias exists remains unresolved, as previous work 
to identify a merger bias through N-body simulations and 



analytic methods yi elds mixed results l|Gottlober et all 
20021 iPercival et al]l20"03t iScannanieco fc Thackerll2003 
Furlanetto fc Kamionkowskill2005|) . 

In this paper we consider the clustering of the most 
massive dark matter halos, measured from two large vol- 
ume (1.1 /i _1 Gpc) 3 N-body simulations described in fJ3 
We concentrate on massive halos, as most previous sim- 
ulations did not have the volume to effectively probe this 
end of the mass function, and furthermore, for the largest 
mass halos the correspondence between theory and ob- 
servation is particularly clean. We first examine the long- 
term growth history of halos, calculating the "assembly 
bias" as a function of growth history in extending 
previous results mentioned above to higher masses. We 
then look to short-term history effects (i.e. events), mea- 
suring the "merger bias" as a function of recent major 
merger activity or large mass gain in §5\ where we find a 
weak, but statistically significant, signal for both cases. 
We conclude in SjHl 

2. SIMULATIONS 

To investigate the effects of formation history on clus- 
tering statistics we use two high resolution N-body sim- 
ulatio ns performed with indep endent codes: the HOT 
code (iWarren fc Salmon! I1993J) and the TreePM code 
( White] l2002jh Both simulations evolved randomly gen- 
erated, Gaussian initial conditions for 1024 3 particles 
of mass 10 11 /i _1 Af© from z — 34 to the present, us- 
ing the same ACDM cosmology (01 m = 0.3 = 1 — 51a, 
n B = 0.046, h = 0.7, n = 1 and er 8 = 0.9) in a pe- 
riodic, cubical box of side l.l/i _1 Gpc. For the HOT 
simulation a Plummer law with softening 35 ft. _1 kpc (co- 
moving) was used. The TreePM code used a spline soft- 
ened force with the same Plummer equivalent softening. 
The TreePM data were dumped in steps of light cross- 
ings of 136/i _1 Mpc (comoving), producing 30 outputs 
from z w 3 to z = 0. The HOT data were dumped 
from z w 1 (lookback time of 5.3/i _1 Gyr) to z = in 
intervals of 0.7/i~ 1 Gyr, with the last interval at z = 
reduced to 0.4/i _1 Gyr. The outputs before z rs 1 had 
so few high mass halos that the statistics were not use- 
ful for the merger event calculations. For comparisons 
of how using light crossing s vs. fixed time s teps i n Gyrs 
changes merger ratios, see IGohn fc Whit^ HoHS). The 
TreePM simulations were used for the assembly histories 
and the HOT simulations for the merger bias calcula- 
tions - though the results from the two simulations were 
consistent so either could have been used in principle. 

For each output we generate two catalog s of halos 
via th e Friends-of-Friends (FoF) algorithm l|Davis et alJ 
1985), using linking lengths b = 0.2 and 0.15 in units 
of the mean interparticle spacing. These groups corre- 
spond roughly to all particles above a density thresh- 
old 3/ (27T& 3 ), thus both linking lengths enclose primarily 
virialized material. Henceforth halo masses are quoted 
as the sum of the particle masses within FoF halos, thus 
a given halo's b = 0.15 mass will be smaller than its 
b = 0.2 mass Csee lWhitel ll2"ooT|) for more discussion). We 
consider halos with mass M > 5 x 10 13 h~ 1 M & (more 
than 500 particles); at z = there are approximately 
96,000 such halos in each box for the b — 0.15 catalog 
and 120, 000 for the b — 0.2 catalog. The mass func- 
tions and merger statistics from the two simulations are 
consistent within Poisson scatter. 
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Given a child-parent relationship between halos at 
neighboring output times, construction of the merger 
tree is straightforward since we are tracking massive ha- 
los rather than e.g. subhalos. Progenitors are defined 
as those halos at an earlier time which contributed at 
least half of their mass to a later (child) halo. Of the 
approximately 10 5 halos at z = we find only 14 for 
which our simple method fails. In these cases a "fly-by" 
collision of two halos gives rise to a halo at z — with 
no apparent progenitors. Excluding these halos does not 
change our results. For the TreePM run, we use all 30 
outputs to construct the merger tree, which stored all 
of the halo information (mass, velocity dispersion, posi- 
tion, etc.) for each halo at each output. Each node of 
the tree pointed to a linked list of its progenitors at the 
earlier time, enabling a traversal of the tree to find mass 
accretion histories and mergers. The HOT run produced 
outputs for each time interval of child and parent halos. 

3. MEASURING CLUSTERING 

A basic measure of clustering is the two-point func- 
tion, which in configuration space is the correlation func- 
tion, £(r). To comp ute £(r) we use the method of 
lLandv fc Szalavl lfl99^ : 



(DD) - 2(DR) + (RR) 
(RR) 



(1) 



where D and R are data and random catalogs, respec- 
tively, and the angle brackets refer to counts within a 
shell of small width having radius r. In computing (DR) 
and (RR) we use 10 x as many random as data points. 
To compute errors, we divide the simulation volume into 
8 octants and compute £(r) within each octant. Since 
we probe scales much smaller than the octants, we treat 
them as uncorrelated volumes, and we quote the mean 
£(r) and error on the mean under this assumption. These 
errors tend to be ~ 1.4-2 times larger than the more ap- 
proximate ^/iVpair error estimates used in some previous 
work. 

Our goal is to test the dependence of the clustering 
of objects associated with some history dependent prop- 
erty. A relevant quantity for comparison is the (mass 
dependent) bias of the halos relative to the underlying 
dark matter, which we define as: 



£,(r) = 6 2 £ dm (r). 



(2) 



Analytically, the large-scale bi as is related to a deriva- 
tive of the halo m ass function l l Efsta thiou et al.| |198 
Cole fc Kaiserll989HMo fc Whiteil99ftlSheth fc To~._ 
1999ft . For the Shcth-Tormcn form of the mass function 
one finds 



b S T{M impb ) = 1 + 



v ,A - 1 



0.6 



5c (1 



,/0.6\ 



(3) 



where v' = OMlS c /a(M 160pb ) and S c = 1.686. This has 
been i mproved upon using the Hubble volume simula- 
tions iJColbe^^^^j2002jHaman^^3 llO^) — see 
also ISeliak fc Warred ^2004^ for discussion of the bias 
define d through P(k) on similar scales. Ha mana et al.l 
(200 j) used FoF halos with b = 0.164 and found 



b{M, R,z) = b ST {M W8l z) 



H01 bias 
S-T bias 




10" 



Mass [M /h] 



x [1.0 + b S T{M W8 ,z)a R {R,z)} 



0.15 



(4) 



Fig. 1.— The bias b(r) = v^M/^dmO) at r = 18/i _1 Mpc for 
two different binnings in mass. The horizontal error bars on each 
point show the range of masses used. The bias was approximately 
scale-invariant in this mass regime from 15 — 30 fe^ Mpc. We show 
2 fits to b(M) pro posed in the literature: that of Haman a et alJ 
i200ll) (dashed) and lSheth fc Tormenl <1999l) (dotted), each plotted 
for both mass binnings. 

The subscripts on the mass M indicate which overdensity 
threshold is being used to define the halo mass. We took 
M = 0.93 Mios and M = 1. 07M 18 nh, calculating the 
conver sion using the profile of iNavarro. Frenk fc W hite 
l)1997|) assuming a concentration c = 5. The change in 
conversion factor was less than a percent for the range 
of concentrations of interest. See IWhitel |2001) for more 
details, discussion and definitions. 

We show the bias b = yj £(r) / £ dm (r) at r = 18 /i _1 Mpc 
as a function of mass in Fig. ^ The bias for halos with 
M > 5 x 10 13 /i _1 AfQ changed less than 5% on scales 
r > 15 h~ 1 Mpc. We include the t wo bias fits given above 
for r = 18/i- x Mpc at z = 0. The lHamana et alJ (2001) 
fit was derived from a larger simulation volume; Fig. is 
included to illustrate the mass dependence of the global 
bias, to provide a comparison context for the sizes of the 
additional biases of concern in this paper. We now turn 
to estimates of bias effects due to the history of the halos. 

4. ASSEMBLY BIAS 

We begin by considering parameterizations of the for- 
mation history of halos which emphasize the global 
properties, i.e. those related to the halo mass growth 
over a long period of time. We consider three pa- 
rameterizations of halo histories which have previ- 
ously be en used with lower mass halos: c, ai/2, 
and a t iWechsler et all 120061: iGao. Sprineel fc White! 
2005| iSheth fc Tor menll2004D. U s ing these parameter - 
izations ISheth fc Tormenl (120041) . Marker et alJ ( 1 20061) . 
IGao Sprineel fc White! i2005t). IWechsler et all i fcooqi . 
and lCroton. Gao fc WhiteT (2006) have shown that the 
clustering of halos of fixed mass is correlated with "for- 
mation time" , a result which has come to be termed as- 
sembly bias. The effect is strongest for smaller halos, and 
this has been the focus of earlier work. For the extremely 
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Fig. 2. — Correlation function of the lowest (filled triangles) and highest (open squares) quartiles of (reduced) concentration, c (left), 
half mass scale factor, (center) and formation scale factor, af (right). The solid line is £(r) for the full halo sample. Top panels: 

10 14 h~ 1 M Q < M < 3 X 10 14 h- 1 M (3 (31551 halos). Bottom panels: 5 X 10 13 h^ 1 M Q < M < 8 X 10 13 h~ 1 M (43638 halos). A clear 
signal is seen for concentration and formation scale factor for the more massive halos. 



massive halos that we consider halo identification is sim- 
pler, as none of our halos are subhalos. However, since 
massive halos are rarer, the statistics are poor even for a 
simulation volume as large as ours. 

The concentration, c , is a parameter in an NFW fit to 
a halo density profile ijNavarro. Frenk fc Whitd lT997') 7 . 
We perform a least squares fit of the NFW functional 
form to the radial mass distribution of all the particles 
in the FoF group, allowing c and M200 to vary simulta- 
ne ously. This is in ord er to be similar to the procedure 
of iBullock et alJ l)2001|) to allow ready comparison. The 
concentration is expected to correlate with the time by 
which most of the halo formed (earlier forming halos ar e 
more concentrated, se e iNavarro. Frenk fc White! |l996); 
iWechsler et"aTI l|2002D : lGao et alJ <|2004[n . There is also 
a weak dependence of concentration on halo mass. We 
have tried to minimize this effect by dividing out the av- 
erage concentration for each mass (calculated from the 
data) to get a "reduced" concentration, which is essen- 
tially uncorrelated with mass (correlation is less than 
0.2%). 

The second parameter encapsulating the formation his- 
tory is ai/2, the scale factor at which a halo accumu- 

7 W e follow NFW and take c = r20o/^si note that Wcchsler et al. 
(2006) use c v ; r = r v j r /r s where r v ; r ~ rioo for our cosmology. At 
2 = 0, c v j r ~ 1.25 c. 



lates half of its final mass. We find a X f 2 by linearly 
interpolating between the two bracketing times. Ana- 
lytic properties of this definition have been studied in 
iSheth fc Tormenl (J2004), and a 1 / 2 is often used as a proxy 
for formation epoch. The third parameter, a/, the for- 
mation scale factor, is also a formation time proxy. It is 
defined through a fit to the halo mass accretion history 
iWechsler et al.ll200l 8 : 

M(z) = M exp [-2a f z] , (5) 

where Mq is the mass of the halo at z = 0. We cal- 
culate this from the history by doing a least squares fit 
of ln(Mi/Mo) against Zi for all the Zi steps. Although 
this form does not fit the mass accretion history of mas- 
sive halos particularly well due to their frequent mergers, 
the fit is well defined and, as will be shown below, a,f 
nonetheless appears to be correlated with clustering. 

The correlations 9 for many of the above parame- 
ters were presented in iCohn fc White! ()2f)05j) . Some 
of these correlation s have been c o mpar e d in differ- 
ent combinations in lWechsler_e^^y_j2002j^, iZhao et "all 
(|2003a|) . IZhao et alJ po"03bj) . IWechsler et all pOO^ . and 

8 IMiller et alJ 120061) present an analytic justification for (his 
form based on extended Prcss-Schcchter theory. 

9 Defined as (jab) - (a) (&)) / sj ((a ~ (a)) 2 )((b - (b)) 2 ), see 
e.g. lLuptonl Il993) . 
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ICroton Gao fc White! l)2006j) . Except for iZhao et all 
(2003aJ3), these were for galaxy scale halos rather than 
galaxy cluster scale halos. The formation histories for 
low mass h alos tend to be smoot her and better fit to 
the form of iWechsler et all <H)02), since they undergo 
fewer mergers than high mass halos at late times. Wech- 
sler and Zhao give a formula f or the concentration i n 
terms of the formation time of IWechsler et all {2002) ; 
our correlation coefficient is characterizing the scatter 
around any such correlation. For the current sample 
the strongest correlation (0.69) is between the forma- 
tion redshift, Zf = 1/a/ — 1, and the half-mass redshift, 
z i/2 = V«i/2 — 1) c onsistent with the 0.70 found by 
iCohn fc White! l)2005l) with a sample about 1/7 the size. 
The formation redshift, Zf, and reduced concentration 
have a correlation of 0.53. The full concentration and 
z i/2 ( z f) have a correlation of 0.56 (0.54). These corre- 
lations increase as the lower mass limit is decreased from 
10 u h^Mg to 5 x 10 13 hr l M Q . 

To highlight any effects of assembly bias we take 
the highest and lowest quartiles of the distribution of 
each of these three parameterization values and com- 
pare th e resulting £(r) t o that of the full sample (sim- 
ilar to Wec hsler et alJ 1)200 61'). We show examples 
for 10 14 /i- x M < M < 3 x 10 14 h- 1 M Q and 5 x 
lO^/i^Mg < M < 8 x 10 13 Ii^Mq in Fig. For the 
higher mass halos we see a strong dependence of clus- 
tering on concentration. We see a similar, but notice- 
ably smaller, dependence on a/, indicating that more 
recently formed objects cluster more strongly. As all 
of the objects we consider have M > M„ our results 
are in line with the expecta tion oflWechsler et alJ (l2006f) 
and th e theoretical model o flFurlanetto fe Kamion kowskil 
(12005ft. Specif i cally, this confirms the result found by 
IWechsler et al.l l)2006l) at z = 0, without needing to make 
the approximation that b(c, M, z) = b(c, M /M*). 

The ratio of their correlation function at their top c 
quartile to the total sample for halos ~ 10 M* was ~ 1.25. 
This is larger than our ratio, which doesn't reach 1.2 for 
any of the radii considered in Fig. |3 though it is well 
within our (and their) errors. This is mirrored for the 
lowest c quartile where our effect is similarly reduced 
but within the errors. We are using reduced concentra- 
tion, while they divide each halo's concentration by the 
average concentration in its mass bin, c v i r - For the lower 
mass sample a much weaker trend is seen (e.g. the ratios 
for the quartiles when selected on concentration barely 
reaches 10%), agreeing with the expectation that the sig- 
nal decreases as M — > M* . At fix ed mass, the trend of b 
with c is consistent with the fit of lWechsler et all j2006), 
but the trend is so weak relative to the noise that the 
r esult is of marginal signi fic ance. 

IGao. Soringel fc White! J2005T) and Marker et all 
l)2006|) found bias for M > M* based on Zi/ 2 , where 
both the lowest and highest quartiles of zi/% tended 
to be more clustered than the full sample. We see 
a hint of this as well , but the fluctuations are large. 
ICroton. Gao fc White! l|200fiT) also found more depen- 
dence of clustering on z%/2 (their formation time) than 
on concentration, once luminosity dependent bias was 
taken out. Note that their luminosity dependence might 
include some of the history measured by concentration 
or z%/2 and their focus was on galaxies populating the 



halos rather than the halos themselves. 

Note also that even though Zf and Z1/2 are correlated, 
the correlation is not strong enough so that bias in one 
implies bias in the other. The overlap of the upper and 
lower quartiles for these quantities for M > 10 14 /i _1 Mq 
is 62% and 54% respectively. As the rest of the clusters 
differ, the overall biases can be quite different, as seen in 

Fig.m 

Another formation time related quantity, the redshift 
of last mass jump by 20% or more in a time step cor- 
responding to the light crossing time of 136 ft, -1 Mpc co- 
moving, had correlations with Z\/2 (0.70), Zf (0.61), and 
c (0.40). We found a small sign of bias in the correlation 
functions of its highest and lowest quartiles as well, lead- 
ing us to expect a merger bias signal, as will be examined 
in Sg| 

In summary, we confirm and extend previous results 
to lower redshift and higher mass for concentration de- 
pendent bias. We see a smaller signal for formation time 
bias, and we see very little (if any) signal for bias based 
on when halos reach half of their mass. Bias in concen- 
tration and half-mass redshift have been seen in previous 
work for smaller masses at higher redshift; our results 
show a smaller bias, but well within errors, at least for 
the concentration dependent bias. 

5. MERGER BIAS 

In the previous section we demonstrated the depen- 
dence of £(r) upon halo formation history, characterized 
by an average property such as the "formation time" . As 
halo assembly histories are punctuated by large jumps 
from major mergers, we can also ask whether the clus- 
tering of recently merged halos differs from that of the 
general population. 

Although the concept of a major merger is intuitively 
easy to understand, there is no standard definition in the 
literature of "merger" or "major merger" (these terms 
will be used interchangeably henceforth). In simulations, 
where the progenitors can be tracked and masses mea- 
sured, major mergers can be defined in terms of masses 
of the progenitors and the final halo. We define progen- 
itors as those halos at an earlier time which contributed 
at least half of their mass to a later halo at the time 
of interest. The three most common ways to define a 
halo merger are: (1) the mass ratio of the two largest 
progenitors, M2/M1 < 1 (2) the same ratio, but using 
the contributing mass of the two most mass-contributing 
progenitors, and (3) Mf /Mi, the ratio of the current halo 
mass to the total mass of its largest progenitor at an ear- 
lier time. We also consider (4) Mf/Mi, the ratio of the 
current halo mass to the largest contributed mass. In 
our simulations the merger fraction per 0.7/i _1 Gyr with 
M 2 /Mi > 0.3 increases by more than a factor of 3 from 
z = to 1. 

One way to quantify how well the two body criteria 
(M2,M% and Mf,Mi) describe the halo growth is to 
consider the ratio (Mi + M 2 )/Mt, This ratio is 1 for 
a halo formed only from its two largest predecessors: a 
two body merger with no other accretion. It is lowered 
by accretion or multi-body mergers. Fig. [3] shows the 
cumulative distribution of (Mi + M 2 )/Mf for halos with 
M > 10 ^h -1 M® satisfying a variety of merger crite- 
ria. We considered both cases where Mi and M 2 are 
the full and contributing progenitor masses. As can be 
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Fig. 3. — The cumulative distribution of (Afi + M 2 )/Mf for 
different subsamples of our b = 0.15 halos at z = 0. Looking back 
0.4 h — *Gyr the subsamples are defined by Mf/Mi > 1.5, 1.2 or 
M2/M1 > 0.3,0.1. The lines are in the same order, top to bottom, 
with the lowest line being the full sample. At left Mi, M2 are the 
full masses of the two largest progenitors, at right Mi , M2 refer to 
the contributing mass. 

seen on the right, for all halos with M > 10 14 /i _1 M at 
z = 0, considering mass gains within the last 0.4 ft, _1 Gyr, 
at least 5% of the final halo mass is not from the two 
largest contributors. As the merger criteria is hardened 
(i.e. the merger is more "major"), the two largest pro- 
genitors contribute less and less of the final mass. As 
can be seen on the left, the same amount of mass as 
found in the two largest progenitors makes up the en- 
tire mass of the final halo in ~25% of the full sample of 
halos. Lengthening the time step or looking to higher 
redshift also increases the fraction of halos getting their 
mass from halos other than the two largest progenitors. 
For simplicity, our subsequent analysis uses only the two 
body criteria to define mergers, so the accuracy of this 
assumption as examined above should be kept in mind. 

Previous work to identify a merger bias through N- 
body simu lations and anal y tic m ethods gives a mixed 
picture. IGottlober et alJ l)2002f) found a clustering 
bias for recently (At = 0.5 Gyr) merged objects with 
Mf/Mi > 1.25 and M < M* at z = 0. These au- 
thors, however, did not try to match the mass distri- 
bution of the comparison sample to that of the merged 
halos — a problem since mergers occur more often for 
more massive halos, and the bias is known to increase 
with halo mass. To isolate the effects due to merging, 
the comparison sample needs to have the same mass dis- 
tribution as the merge d sample, and most s ubsequent 
work has ensured this. iPercival et alJ l)2003|) found no 
bias between the correlation functions of recently merged 
(At = 10 8 yr, M 2 /Mi > 0.3) and general samples at 
z = 2 for halos with M ~ M*, 25M», and 150M*. 
iScannanieco fc Thackerl l)2003fl confirmed Percival et al.'s 
results for major mergers in a z = 3 sample for a smaller 
range of masses, but surprisingly found an enhancement 
of clustering for halos with recent (At = 5 x 10 7 , 10 8 yr) 



large total mass gain, Mf/Mi > 1.20. That is, they find a 
bias when selecting halos with a recent large mass gains, 
but not when selecting on recently merged halos' parent 
masses. Their signal was weak due to limited statistics. 

That the previous literature is inconclusive is to be ex- 
pected, given that the effects of merger history upon clus- 
tering are small, and extremely difficult to measure nu- 
merically. We expect the largest signal when M ^ M*, 
but this is where the number density of objects is small- 
est. In addition, the most extreme mergers are the rarest, 
increasing the shot-noise in the measurement of £(r). 
If we include more common events, the "merged" and 
"comparison" samples become more similar, washing out 
the signal of interest. At higher redshift, the merger rate 
increases, thus the merged and comparison samples have 
more overlap unless the merger ratio is increased, leading 
to worse statistics. To try to overcome these statistical 
effects, we use our very large samples of simulated halos 
to search for a merger, or temporal, bias. 

To define a "recent major merger" requires both a 
choice of threshold for one of the merger ratios and a 
choice of time interval. As we expec t the halo cross- 
ing ti me to be ~ 0.7/i~ 1 Gyr. fe.g. iTasitsiomi et alJ 
(200 j; IGottlober Klvpin & Kravtsovl (|2001jk 

iRowlev. Thomas fc Kavl l|2004|) K we expect that 
outputs at this separation or shorter are small enough 
to catch recently merged halos while they are still 
"unrelaxed". That is, a "recent merger" might be 
expected to correspond to a dynamically disturbed halo. 

We consider the four merger criteria mentioned above, 
as well as a wide range of samples and merger definitions. 
We used 9 different time intervals from z ~ 1 to z — 
as given in |21 We considered 4 different thresholds for 
both Mi /Mi and Mf /Mi using both total and contribut- 
ing mass of the progenitors: M 2 /Mi > 0.1,0.2,0.3,0.5 
and Mf/Mi > 1.2,1.3,1.5,2.0. Furthermore, we used 
two minimum masses, 5 x 10 13 h~ 1 M Q , 10 14 H~ 1 Mq, and 
two FoF linking lengths, b = 0.15,0.2. Combinations of 
each of these criteria resulted in over 700 different pairs 
of "merged" and "comparison" samples. Although this 
data set is very rich, systematic trends are difficult to 
identify. This is in part because increasing the merger 
"strength" simultaneously increases the noise (due to 
lower numbers of events). 

Evidence of bias is very slight in the binned £(r). We 
used three methods to try to isolate the signal: the 
marked correlation function, the integrated correlation 
function, and a likelihood fit to a power law for the cor- 
relation function. The clustering and merger criteria in- 
fluence these three quantities in distinct ways. We now 
describe each method, and our corresponding results, in 
turn. 

5.1. Marked correlation function 

One problem with computing merger effects in 
terms of £(r) is that, to compute the difference in 
clustering of merged and random samples, one must 
define a boolean merger criterion — a halo is either 
in the merged sample or not. As halo histories are 
complex, a more nuanced measure of merger cluster- 
ing is useful, and this can be provided by using the 
marked correlation function llBeisbart fc Kerscherll2000l: 
i Beisbart. Kerscher. fc Meckel l2002t IGottlober et all 
120021 ISheth fc Tormenl 120041: Marker et all 1200ft 
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Fig. 4. — The marked correlation function for halos in the range 
5-7 X 10 13 h -1 Mq at z = 0. The mark is the maximum progenitor 
mass ratio, M2/M1, within the last 1 h~ Gyr. The error bars come 
from dividing the sample into 8 octants. 



IShetti. Connolly, Skihhl 12001 . Each of N objects 
gets assigned a mark, m„ for i = 1,...,N. Denoting 
the separation of the pair by r-y, the marked 

correlation function, M(r), is defined by 



M(r)=J2 



rriimj 
n(r)fh 2 



(6) 



where the sum is over all pairs of objects (i, j) with sepa- 
ration Tij — r, n(r) is the number of pairs, and the mean 
mark, m, is calculated over all objects in the sample. The 
marked correlation function "divides" out the clustering 
of the average sample, and thus a difference in clustering 
is detected for M(r) 7^ 1. 

We consider five marks: M2/M1 (for both total and 
contributed masses), Mf /Mi, Mf/M\ (where Mi is con- 
tributed mass) and |(1 + M2/M1). The last case had 
a smaller range of marks, and thus tests sensitivity to 
extreme events. The results for this mark were similar 
to the others, suggesting that we are not dominated by 
outliers. Halos are chosen with mass in a narrow range, 
M m i n < M < \/2M m i n , to minimize the previously men- 
tioned bias due to merged halos being more massive. The 
global bias changes less than a percent over the mass 
ranges we consider. 

In our combined sample of several output times and 
mass ranges, the largest signal comes from using as mark 
the maximum value of M2/M1 within At of the present, 
as shown in Fig. 0] As At was increased the signal went 
smoothly to zero. We find similar behavior for M^jiMi + 
M2), which suggests that any bias is contributed by the 
systems where Mi <C M\. The signal is extremely weak 
for the other marks we considered. By stacking the signal 
across multiple output times (see M5.3l for details) we are 
able to find small, but statistically significant detections 
of excess power for the marks M 2 /Mi, M 2 /(Mi + M 2 ), 
and Mf/Mi, for halos near 5 x 10 13 /i _1 M©. At higher 
masses there is weak evidence for an effect, but the large 
error bars weaken the statistical significance. 

As the marked correlation function approach finds only 
a weak signal, typically an enhanced clustering of order 
5-10%, we also explore two indicators which character- 
ize the correlations by fewer parameters: the integrated 



correlation function observed at a single scale, and a like- 
lihood fit to a power law correlation function. 

5.2. Integrated correlation function 

Given an object at some position, the integrated cor- 
relation function 

£(r) = -| [x 2 Z{x)dx (7) 

is the probability, above random, that a second object 
will be within a sphere of radius r. This quantity en- 
hances any increased clustering at short distances, but 
gives error bars that are even more highly correlated than 
those of the correlation function, £(r), itself. A typical 
result is shown in Fig. where a significant signal can 
be seen. As in the previous section, we find a weak signal 
regardless of merger definition in our 700 plus samples. 
Considering all the samples and all the separations r, 
more than 2/3 of the time the difference £mergeM — £aii( r ) 
was positive. 

This method separates the data into radial bins, re- 
quiring us to estimate the clustering at many locations. 
Since the errors on the binned correlation points are 
highly correlated, we reduced £(r) to a single measure- 
ment by fixing a preferred scale. The signal tends to be 
largest near r = 20/i _1 Mpc (though the signal is largest 
at r = 30/i _1 Mpc in the examples in Fig. 0, and so we 
compare £(r) of the merged and general samples at this 
radius. On average, when a 2<r signal is seen (5-15% 
of the time, depending on mass ratio, etc.), £(r) for the 
mergers is ~20% higher than for the general sample, al- 
though in extreme cases the difference can be as large as 
a factor of 2 or 3. Due to the noisy statistics it was hard 
to identify any clear trends. 

5.3. Likelihood fit to ro 

The integrated correlation function sums all pairs 
within a spherical region. As an alternate approach, we 
approximate the correlation function as a power law over 
some range of radii, and we perform a likelihood fit to 
this power law correlation function: 



min 3 ' mi ax 



(8) 

). This method in- 



C(r) = 

over the range of scales (1 
corporates information from many scales, similar to the 
integrated correlation function. However, it is combined 
with the expectation that the correlation function should 
be a power law and excises the center region. By us- 
ing the positions of the halos directly in the fit to the 
likelihood, the errors differ from those in the integrated 
correlation function as well. 

Assuming that the pair counts form a Poisson sample 
with mean proportional to 1 + £( r), the likelihood L is 



witn mean proportional to 1 + £;(r), tn 
iJCroft et al.lll997t iStephens et al.lll997|) 

In L(r ) = -2tt n 2 j r 2 [1 + f(r)] dr 



+ In (n 2 ^ [1 + £(r w )]) + const , (9) 

i<j 

where the sum is over measured pairs i , j with separation 
ri j , n is the measured average density 10 , and £(r) is given 

10 We find that marginalizing or maximizing over n as a free 
parameter results in biased fits for several samples. 
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Fig. 5. — The integrated correlation function, £(r), of recently (within 0.4 h x Gyr) merged halos (triangles) and a comparison sample 
of the same mass (squares) for M2/M1 > 0.1 (left), M2/M1 > 0.2 (middle), and Mf /Mi > 1.2 (right), where M\,Mi are the full masses 
of the progenitor halos, for halos in our b = 0.15 catalog at z = 0. The number of halos that merged out of the 96319 total halos with 
M > 5 X 10 13 fc _1 M0 is shown at upper right in for each case. For these three examples, the differences between the two samples are 
largest at 30 h~ 1 WLpc, with significance 3.1c (left), 2.7a (middle), and 2.5<r (right). 
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Fig. 6. — (Left) The correlation function for a recently merged sample (triangles) and a comparison sample (squares) of the same mass. 
The lines indicate the best-fit 7 = 1.9 power law model, fit directly to the cluster positions (not the binned §(r)). (Right) The likelihood 
for the clustering amplitude, ro, assuming a slope 7 = 1.9 for the same samples at left. The sample is at 2 = 0, with a minimum mass of 
5 X 10 13 k _1 M0 (6 = 0.15), looking back 0.4/i _1 Gyr. Mergers are tagged as having M2/M1 > 0.2, M±,M2 full progenitor masses. 
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Fig. 7. — The (scale-independent) ratio of the power law 
fit correlation functions for the merged and comparison sam- 
ples, as a function of lookback time/redshift; At = 0.7h~ x Gyr 
(triangles), 0Ah~ 1 Gyi (square). The mergers satisfy the crite- 
rion M2/M1 > 0.2, with M2,M\ total progenitor mass, for the 
M > 5 x 10 13 h~ 1 M Q halos in our b = 0.15 catalog. No evidence 
of systematic bias evolution with redshift is found. The enhanced 
clustering at z = arises presumably from the shorter time interval 
used. 

by Eq. JSJ. We fit over the range 5-25 h~ 1 Mpc, where 
the correlation function exhibits an approximately power 
law behavior. For the comparison sample we multiply 
the likelihoods for several different realizations, to reduce 
the noise, and then renormalize to unit area. A typical 
result, where a significant signal can be seen, is shown 
in Fig. |HJ demonstrating both the power law fit and the 
maximum likelihood distribution. For the fits, ro was 
usually ~ f0/i _1 Mpc, within the range where the power 
law fit was being applied. 

Across all of our samples, we find 7 ~ 1.9 ± 0.1. To 
allow us to compare different samples more easily, we re- 
duce the number of free parameters to one by holding 
7 = 1.9. A typical example, demonstrating the ratio 
of the power law fit correlation functions of the merged 
and general sample, is shown in Fig. [7] as a function of 
lookback time/redshift. Since we fix 7 = 1.9 for both 
the merged and general sample, the ratio £ mer ge/£aii us- 
ing Eq. (|SJ) is scale-invariant within our fit range. While 
the enhanced clustering of the recently merged sample is 
small, it remains statistically significant. Typically, the 
merged sample shows an enhanced clustering of 5 — 10% 
in the correlation function for the 0.7/i _1 Gyr spacings, 
though we find no strong evidence of systematic bias evo- 
lution with redshift. Moreover, at z = 0, where the spac- 
ing is smaller (0.4 /i _1 Gyr), we find a significantly en- 
hanced ^(r) for the mergers, often 10—20%. Presumably, 
this increased clustering signal is caused by the smaller 
time interval. Larger intervals encompass more mergers, 
leading to smaller errors, but also leading to a smaller 
signal, since mergers now encompass a more significant 
fraction of the comparison population. As mentioned 
above, looking at earlier times also makes the merged and 
comparison population overlap increase dramatically. 

By averaging £ mer ge /£aii across all of the 0.7 '/i Gyr 
spacings from z « 1 to z = 0.04, we are able to study the 
size of the merger bias simply as a function of merger 
ratio. Figure [5] shows the increase of £ m erge/£aii with 
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Fig. 8. — The (scale-independent) ratio of the power law fit 
correlation functions for the merged and comparison samples as a 
function of merger ratio, M2/M1 (left points; full progenitor mass) 
and Mf /Mi (right points), for halos above 5 X 10 13 h~ 1 M@ in our 
b = 0.15 catalog. Mergers are counted within 0Ah —1 Gyx of 2 = 
(squares), and an average across all 0.7 h~ Gyr spacings from z w 1 
to z = 0.04 (triangles). In both cases, clear trends can be seen. 

M2/M1 (full mass) and Mf/Mi both for mergers within 
0.4/i _1 Gyr of the present and for the redshift-averaged 
0.7 h^ 1 Gyr spacings. The merger bias clearly increases 
with increasing merger ratio, with the smaller time step 
yielding stronger clustering as described above. 

In summary, we find a weak bias in many cases (but not 
all — the signals are very noisy) for recen t major merg- 
ers an d recent large mass gains. While Percival ct al. 
(2003) found no such merger bias, our signal is consis- 
tent with their upper limit of 20% on the bias effects 
of rec ent mergers. The work of iScannapieco fc Thackeil 
(2003) saw a small bias for large mass gains but noted 
that their statistics limited their ability to determine the 
significance. Our larger box allowed us to incorporate 
the effects of cosmic variance, which had been neglected 
in previous work. Cosmic variance increased the errors 
by 40% or more, which limited the significance of the 
signal. Nonetheless, we still found a small bias for both 
mergers and large mass gains. 

6. CONCLUSIONS 

The large-scale structure of the Universe is built upon a 
skeleton of clustered dark matter halos. For the past two 
decades we have known that rarer, more massive dark 
matter halos cluster more strongly than their lower mass 
counterparts. Halos of a fixed mass, however, can differ 
in their formation history and large-scale environment, 
and recent work on halos smaller than galaxy clusters 
has shown that this can lead to further changes in their 
clustering. 

In this paper we have used two large- volume, high res- 
olution N-body simulations to study the clustering of 
massive halos as a function of formation history. We 
confirmed earlier results that the lower concentration 
massive halos are more clustered than the population as 
a whole; extending these results to higher masses (and 
thus lower redshifts) than had been probed previously 
(Wcch sler et al.N2006|) . (Previous work had looked at 
similar regimes of M/M* but for smaller M and thus 
higher redshift; note again that exact scaling with M/M* 
is not expected for non power law P(k) and fl m 7^ 1.) 
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Similarly, we confirmed the enhanced clustering of halos 
with later formation times, though the signal was not as 
strong as for concentration. The signal for bias based 
on a h alo reaching half of its mass is weaker than that 
seen in lGao. Snringel fc White! 1)20051) (again for higher 
z), and not statistically significant in our case. 

We also investigated whether recent merger activity af- 
fected the clustering of massive halos — a topic with a 
muddied history in the literature. While we found statis- 
tically significant (> 2a) merger effects on clustering in 
many cases we considered, both for recent major merg- 
ers and large mass gain, in most cases this signal was 
weak: a 5-10% increase in bias. Our strongest signal 
came from using a likelihood fit of the correlation func- 
tion to a power law, particularly for major mergers within 
0.4 h^ 1 Gyr of the present, where we saw a typical merger 
bias of up to 20%. This bias signal is not necessarily 
at odds with the lack of signal in previous work, which 
looked for larger bias than that seen on average here. 

Even with a (1.1 /j _1 Gpc) 3 volume, massive halos re- 
main very rare objects and small changes in their cor- 
relations are difficult to detect. We were plagued by 
the competing effects that increasing the severity of the 
merger (and hence underlying signal) decreases the num- 
ber of pairs, worsening the statistics. General trends re- 
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